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Abstract. We study the time that the simple exclusion process on the complete graph 
needs to reach equilibrium in terms of total variation distance. For the graph with 
n vertices and 1 <C k < n/2 particles, we show that the mixing time is of order 
|n log min(fc, y^n), and that around this time, for any e, the total variation distance 
drops from 1 — e to e in a time window of whose width is of order n (i.e. in a much 
shorter time). Our proof is purely probabilistic and self-contained. 
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1. Introduction 

Let G = (V,E) be a finite connected graph and 1 ^ k ^ |V| — 1 an integer. We define 
a configuration as an element of rj G {0, 1} with k ones and |V| — fc zeros (ones can be 
considered as particles moving on the graph). The simple exclusion process on the graph 
G with k particles, can be described as follows: One starts with a given configuration and 
at each time step, one chooses an edge e uniformly at random in E and one interchanges 
the contents (zero or one) of the two vertice adjacent to e. This Markov chain is reversible 
and has the uniform measure over all configurations as equilibrium measure. 

In this paper we study the rate of convergence to equilibrium of the exclusion process 
on the complete graph with n vertices, k(n) particles, and k going to infinity with n . 
For obvious symmetry reasons, one can restrict the problem to the case k ^ n/2 without 
any loss of generality. For the sake of clarity we give a formal definition of the exclusion 
process 

The configuration space is 

n(n, jfc(n)) := [v G {0, l}^^ | ^ v (x) = fc(n)}. (1.1) 

ire[l,n]nN 

We consider the discrete-time Markov chain on 0(n, k(n)) that at each time step, inde- 
pendently selects two vertice uniformly at random in {1, ...,n} and interchanges their 
contents (emptiness or particle). Note that with probability 1/n the two chosen vertices 
are the same: in that case nothing happens. We give now the transition kernel of this 
process: The symmetric group S n acts transitively on Vt{n,k{n)) in a natural way. If 
a G S n then 

[a.^x)-j\{a-\x)\ (1.2) 
and for rj, rj G 0(n, k(n)), n ^ if, the transition rates are given by 

jd( '\ — \ TP if = T -V for some transposition r , . 

n»7; »7 J - | o if it is not the case ^ } 

One can check that this implies P{j],rj) ^ 1/2. 

l 
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This Markov chain is reversible, aperiodic and its equilibrium measure is the uniform 
measure on Q(n, k{n)) that we denote by ir. Given an initial configuration £ G Q(n, k(n)), 
one writes P^ (and denotes the associated expectation) for the law of the Markov chain 
(?7t)t Ss o started from the configuration rjo = £ and [if for the marginal distribution of P^ 
at time t (t € N). 

We study the convergence to equilibrium of this chain. Distance to equilibrium is given 
by the following quantity 

dW(t):= max ||^-vr||. (1.4) 
fen(n,fc(n)) 

where || • || denotes the total variation distance: for two measures on Q(n,k(n)) 

\\^-A-=\ Iriv) - n(v)l ( L5 ) 

r?6f2(n,fc(n)) 

The main result of this paper is a sharp estimate of the time needed to reach equilibrium. 

Theorem 1.1. If lim n _K>o k(n)/y/n = oo, then for every e > there exists j3 > such 
that for all n 



d^ (^n log n + fin^j ^ e 

log n - (3n^j ^ 1 — e. 



(1.6) 



i/'lim n _ s . 00 k{n)/y/n = 0, and lim n _ s>00 /c(n) = oo, i/ien /or every e > i/iere exists (3 > 
suc/i i/iai /or a// n 



d (n) Qralog£;(n) + /3n^ ^ e 

Qnlogfc(n) - /3n \ ^ 1 - e. 



(1.7) 



Ifk(n)/y/n^rl G (0,oo) ; f/ien flL6|) a™d ([LTD /ioM. 

The function i 1— )■ d^™) (t) is non-increasing. Thus for any e one can set 

r££(£) := inf{i | <*(")(*) < e} = sup{t | d(")(t) > e}. (1.8) 

From the above theorem one has that for any e > 

T£L(e) - Tit(l - s) = £ (n) = o £ (T mix ), (1.9) 

where O e and o e underline dependence in e. In words: the time in which the distance to 
equilibrium drops from close to one to close to zero is at most of order n and much smaller 
than T m ; x . This phenomenon is known as cutoff. It was first identified by Diaconis and 
Shashahani [7] for the random walk on the symmetric group generated by transpositions 
(see also [2] for a recent extension of this result with a probabilistic proof), and was given 
its name in the celebrated paper of Aldous and Diaconis [IJ where it is shown that cutoff 
occurs for top-to-random card shuffle. The bound that is obtained for T^(e) — T^(l — e) 
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(in the present case, 0(n)) is often called the cutoff window . Our result is optimal in the 
sense that 0(n) is the best window one can obtain: it will be shown in the proof that 

T (n) (e) -T (n) (l -e) 
limliminf mixV ' ^ '- = oo. (1.10) 

The simple exclusion process on the complete graph maps to another problem: the 
Bernouilli Laplace Diffusion Process. In [8j, Diaconis and Shashahani studied this model 
and, using purely algebraic methods, proved cutoff in the case k(n) = n/2. Their method 
should be extendable to some other values of k (e.g. using the same method Donnelly, 
Lloyd and Sudbury [3] extended the result to the case where k = an for some a G (0, 1)), 
but it clearly fails to give the right result when k(n) <C yjn (e.g. the upper-bound given in 
Theorem 2 fails to be sharp in that particular case). We also underline that the methods 
we present here are purely probabilistic. 

The simple exclusion process on the complete graph can be seen as a projection of the 
random walk on the symmetric group generated by transposition, and therefore the mixing 
time for simple exclusion is always smaller than the mixing time for random transposition. 
What our result underlines is that while the spectral gaps (this a general result that holds 
for every graph, see [3j) for the two processes are the same, the mixing time differ: the 
mixing time for the random transposition model is n/21ogn(l + o(l)), whereas the mixing 
time for the exclusion process is at most n/41ogn(l + o(l)). This result is specific to the 
complete graph: for the exclusion process on the segment or on the circle, mixing time 
for exclusion process with a density of particle and interchange process are expected to 
coincide [14] . 

Let us also compare the mixing time of the simple exclusion process with the mixing 
time of the simple exclusion process with k labeled particles: the space of configurations 
is 

fi'(n, k(nj) := {7/ € {0, 1, ... , k}^ m \ Mi G {1, . . . , k} 3\x, n{x) = i\ . (1.11) 

The rules for the evolution are the same: at each time step, one chooses two vertice at 
random and interchanges their contents. As there is no risk of confusion we use for the 
labeled process the same notation as that for the exclusion process. The equilibrium 
measure ir for this process is the uniform measure over £l'(n, k(n)). What we can show is 
that if k <C yfn then the mixing times of labeled and unlabeled exclusion process coincide 
but that they differ as soon as k 3> yjn. We suspect that for every value of k and every e 
one has 

77 

r mix (e):= -log k + 0(n). (1.12) 

We prove in the following that 

Theorem 1.2. For the exclusion process on the complete graph with n vertice and k labeled 
particles, for every e, there exists (3 > such that for every k and n 

d( n ) (ln\ogk-Pn\ ^ 1-e. (1.13) 

Moreover z/lim n _ s . 00 k(n)/y/n = then for every e, there exist (3 > such that for all n 

d {n) Qnlogfc(n) + (3n^j < e. (1.14) 
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The mixing time of the simple exclusion process has been studied for some other graphs 
than the complete graph. However, to our knowledge, cutoff has not been proved for any 
other graph. We refer to [14] for a study of the mixing time of the simple exclusion process 
on the segment {0, . . . , n} (the edges of the graph are the (k, k + 1), k E [0, n — 1]), p2] 
for simple exclusion on the d-dimensional torus, and [13J for a recent general study of the 
exclusion process mixing time. 

The sequel of the paper is organized as follows 

• In Section [21 we reduce the study of the the unlabeled exclusion process to the 
study of a birth of death chain, which is a first step towards the proof of Theorem 

• In Section [3l we prove Theorem II .11 in the case of small k. 

• In Section^ we prove Theorem 11.21 

• In Section [5l we prove Theorem II .11 in the case of large k. 



2. Reduction to the study of a birth and death chain 

Our Markov chain is a lazy simple random walk on a transitive graph. Therefore, by 
transitivity, the distance — 7r|| does not depend on the initial configuration £. We can 
set ?7o to be 

i \t \ J 1 if x G [1, k{n)\ , , 

fo>)(*):=| o ifnot (2-1) 

and we simply write P - and E for the associated expectation - (resp. //+.) for the law of 
(Vt)t > o (resp. r]t) starting from this configuration. 

We now claim that for every t, [it is invariant under permutations of the coordinates in 
{1, . . . , k(n)} and in {k(n) + 1, . . . , n}. This is obviously true for t = 0, and this remains 
true for t > as the dynamic itself is invariant under these permutations. Therefore, if 
one sets 

k(n) 

x=l \ ' ' 

W t :=W(th). 

then for any m and t such that [i(Wt = m) > 0, //t(-|W = m) is the uniform measure over 
all the configuration r\ such that W(rj) = m. 

Let fit and tt be the law of W under m and ir respectively. The preceding remarks 
imply that 

d(t) = \\fit - 7f t || (2.3) 

One can check that the evolution of Wt is Markovian. Our problem is now confined to the 
study of the mixing time of this new Markov chain, which is what is called a birth and 
death chain on {0, ... , k(n)}. 

We write ji\ for the law of Wt starting from Wq = i. As (Wt)t > o is a projection of the 
Markov chain (rjt) ^ o> ° ne has 

Wik ~ ^11 = max \\iif — ir\\ ^ max \\u\ — 7f|| ^ 11/2+ — vfll, (2.4) 
esn(n,Jfc(rO) t6{l,...,Jfc} 

that is to say that for any value of t, Wt is farther from the equilibrium measure if Wq = k. 
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Remark 2.1. At this point of our analysis, one can already show that there is cutoff for 
our process. Indeed, the cutoff phenomenon for general birth and death chain has been 
studied in [S], in which the authors prove that T rc \ = o(T m ; x (l/4)) is a necessary and 
sufficient for having cutoff (where T re i, the relaxation time is by definition the inverse of 
the spectral gap). This condition can be checked rather easily in our case. However, one 
cannot get the location of the cutoff, nor the correct order for the size of the window by 
using only this general result. 

We use the notation P to denote the transition probability of (Wt)t > o- For the sake of 
clarity, we often omit the dependence in n in the notation. We have 

x 2(k-i) 2 

F(M - 1) = 2i( " " f + l \ (2-5) 
n z 

P( . n 2 - 2[(k - i) 2 + i{n - 2k + i)} 

n z 

We end this section with a first simple Lemma giving the expectation of W%. It will 
constantly be used in the sequel. 

Lemma 2.2. One has, for any value of k, for any t 

Et[W t ] = (w(0 - - -) + - (2.6) 



n J \ n J n 

Proof. Using the jump rates we compute the expected value of Wt+i given Wt- One has 

K^[Wt + i\Wt} =W t + P(W t , Wt + l)- P(W t , Wt-l) = —+(w t - —) ( 1 - -V (2.7) 

n \ n J \ n J 

Taking the expectation on both sides, and making a trivial induction, one gets the desired 
result. □ 

3. The case k(n) < n 1 / 2 
In this section we prove the main theorem with the assumptions that lim^—^oo 

k{n)/y/n = 

and lim n _ s . 00 k{n) = oo. 

3.1. Upper bound on T m \-^. Under the assumption that linL^—^QQ — — 0, Tt(W = 0) = 

1 — o(l). Indeed, with this condition, the expectation of W at equilibrium is 

it(W) = = (i). (3.1) 

n 

Let us choose 7 > and set tj = - log k(n) + 771 (is has to be thought as the integer 
part, but we omit this in the notation to keep things simpler; at any rate it would not 
change the proof). One has 

/ 2 \ 5 log k +~i n 

jl t JW) = k(l-~) +o(l) (3.2) 



n 



Altogether we get that 
And therefore 



|| At, - 7f||< e-"* + o(l). (3.3) 

77 

t (e)<-(logA;(n)-loge + o(l)). (3.4) 
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□ 

3.2. Lower bound on T m [ x . To get the other bound, we make the following consideration: 
the equilibrium measure tt is concentrated on the event {W = 0}. Therefore, on the 
original exclusion process, every particle has to be moved at least once in order to be 
significantly close to equilibrium. To formalize this properly, we present an alternative 
construction of the simple exclusion process. 

Let (Xt, Ytjt > l be a sequence of i.i.d. random variables distributed uniformly on {1, ... , n} 2 
(we include this process in the probability law P). Under E^, we start from r]Q = £ and we 
build rjt from rjt-x by interchanging the content of sites Xt and Yt if Xt ^ Yt: 

( m -i(x) ifx${X t ,Y t }, 
m (x) := { Vt (Y t ) ifx = X t , (3.5) 
[ Vt (X t ) ifx = Y t . 

We define r as the time were all the sites in {0, . . . , k(n)} have been selected at least 
once by the process (X, Y) 



t := inf \ t > | \J{X S ,Y S } D {1, . . . , k(n)} 



(3.6) 



Notice that if t < r, then W% ^ 0, so that 

\\fit-Sw=o\\>^[r>t} (3.7) 

Therefore one has 

||fo-7f|| >P[r>i]-o(l) (3.8) 

Estimating the time r boils down to the so-called coupon collector problem (see |10j . 

Section 2.2 in particular). Set X' 2s _ 1 := X s and X' 2s = Y s . Then X' s is an i.i.d. sequence 
One has the following equality in law 

k 



T = \r'/2] 



1=1 



where 



t' :=inf{^0 \{X' s ,s = l,...,t}D{l,...,k(n)} }. 
and the £{ are defined by 



(3.9) 
(3.10) 



:=inf{t^0 \#({X' s \s=l,...,t}n[l,k{n)}) = i) . 



(3.11) 



It is not difficult to check that {£i)i^\\M are independent geometric variables of mean 
( fc_" + i )ig[i,fc]- From this, one gets the following moment estimates: for some constant C 



E 



Varp 



E* 

i=i 



^ ralog — Cn, 



.i=i 



1 - (i/n) 



(3.12) 



< Cn 2 . 
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Therefore, if one chooses by := ^nlog k — 772 (suppose that this in an integer), one has by 
classical second moment inequality: 

k 



i=l 



c 



And therefore from (|3.8p and (|3.9p 

d(tj) > 1 

and hence, for any e > 

n 



C 



(27 -c? 



(27 -c) 2 



(3.13) 



T mix (l -e)>- (log k(n) - VC/e -C + o(l) J . 



(3.14) 



(3.15) 



(One could get a tighter bound with log e instead of — e by using exponential moments 
instead of second moment). □ 

4. Bounds for the labeled process 

The methods of the previous section can be applied for the proof of Theorem 11.21 
For the lower-bound we remark that at equilibrium, for any i 6 {1, . . . , k}, 7r(ji(i) = i) = 

—. Thus in every case the expected number of fixed points is less than one: 7r(#{i|r/(i) = 

i}) ^ 1, and for any integer K 

Tr(#{i\r](i) =i}>K)^ l/K. (4.1) 

One constructs the exclusion process from (X t ,Y t ) t ^ q as in the previous section. We 
define r as the first time at which all of the k first sites have been selected. 

r:=inf jt^O I #((|J {X s , Y s }) n {1, fc(n)}) ^ k - if J . (4.2) 

If one starts the process from 

(Vo)(x) ■= 

One has 



x if x € [1, k(n)] 
if not 



fJp(#{i\v(i)=i}>K)>P(t>T) 



and therefore 



||/j?»-7r||>P(t>T) 



K 



Taking the same definition for Si as in the previous section one has 

k-K 

1=1 

and £i are independent geometric variables of respective mean f 

-k-K 



(4.3) 

(4.4) 
(4.5) 

(4.6) 



One has: 



E 



Varp 



" k 



i=i 



^ n log — n log K — Cn, 



(4.7) 



i=if 



8 



HUBERT LACOIN AND REMI LEBLOND 



and therefore, using Chebychev inequality one gets: 

nr^- 2 lo g k-nlo g K)^ K{lQg C K _ c)2 . (4.8) 
Overall, this gives that for K sufficiently large and for t = ^ log k — n log K: 

\\HT-A>1-^. (4.9) 

which gives 

Tl 

T mix (l-e) > -logfc + nloge/2. (4.10) 

For the upper bound, we assume that lim n _ 5>00 k(n)/^/n = 0. We notice that as for the 
unlabeled process, the distance to equilibrium is the same for every starting position by 
symmetry. 

d(t) = \\tf -tt||. (4.11) 

Let W be the number of particle lying on the vertices {1, . . . , k}. (W(rj) := Ylx=i l»?(x)^o)- 
Now notice that for every t ^ k one has 

H?(-\W = 0) = tt(-\W = 0), (4.12) 

as the initial condition and the dynamics are invariant under permutation of {k(n) + 

1. . . . ,n}. Therefore the same analysis as in section [3~T1 gives 

Tb 

T mix (e) ^ - (log k{n) - log e + o(l)) . (4.13) 

5. The cases k(n) > s/n and k(n) 

In this section we prove the main theorem with the assumption that either 
linin-j.oo k(n)/n = oo or lirrin^oo k(n)/n = I > 0. The latter case is a bit more complicated 
than the other as the distribution of W at equilibrium is asymptotically non-degenerate. 
One makes use of second moment arguments for the lower bound of the mixing time, and 
a diffusion argument to get an upper bound. 

5.1. Lower bound on the mixing time. In this section we work with the weaker 
assumption liminfn^oo k(n)/^/n > 0. To get the right bound on the mixing time one uses 
a second moment method. The first and essential step is to compute a tight estimate of 
fit [W 2 ~\ . We start by writing an explicit formula for the second moment of Wt ■ 

Lemma 5.1. 

E[^ 2 ]=(l-- + 4)^ 2 
\ n n A 



1 ' '4k 2 8k 2A_ rTI _ 1 2k 2 
— - — + -) E [W s ] + 





4 y- 1 - 






y n 





(5.1) 

=o 7 J v 7 

Proof. One simply uses the transition of the Markov chain to get a recurrence relation 

n\V 2 +1 \W t ] = W 2 + 2W t (P(W t , W t + l)- P(W t , W t -l)) + P(W t , W t + 1) + P(W t , W t -l). 

(5.2) 

By taking the expectation on both sides, one gets the following recursive relation 

ego = ( i - - + 4) nw?\ + (^ r - 8 4 + -) nwt\ + -■ (5.3) 

which after induction gives the expected result. □ 
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Then, using the formula above, we get a clean bound on the variance of Wt- 

Lemma 5.2. There exists a constant C such that for any K, n large enough (depending 
on K), andt = § logn- 3p, 7 G [~K,K] 



_ _ ...n, Ck &^ k 

Var P (Wt 2 ) ^ + -=. 

n \Jn 



(5.4) 



Proof. Suppose that t = ^(logn — 27) for some 7 € [— if, K\. All the O(-) are uniform in 
7 G [—if, if] when n is large enough. We start by giving an estimate of the expectation 
squared 



[E(W)] 2 = 



k 2 / fc 2 
+ fc 



n 



n 



1 

2n 



fc 2 / fc 2 \ . 
— +(k )-= ( l + O 



logn 



n 



= 4 + ^7? f fe " -1 e7 + " _1 f fc " - V e ' 7 + °( fc2 /^)- (5-5) 
n z n 6 ' 2 \ n J \ n J 

Estimating E(W 2 ) is a bit more tricky. For practical reasons we divide it in three terms 



2k 2 



^ V 1 n + n 2 ) k + n 2 ^-i\ n + n 2 



s=0 



t-l-s 



+E 

s=0 



4k 2 8k 



_ + _ E (W S ) 1 - - + - 
n z n / \ n n z 



t-l-s 



(5.6) 



The first one gives 



n rr 



fc 2 e 2 ^ 



n 



+ 0(k 2 /n). 



(5.7) 



The second: 



2 A; 2 



t-i 



s=0 



4 4 

- + — 

n rr 



t-l-s 



= 0{k 2 /n). 



(5.8) 



We divide the third term it into two contributions 



t-i 

E 

s=0 



\k 2 8k 



n 2 n 



~ -o + - E(W S ) 1 - - + 



n rr 



t-l-s 



4/c 2 8k 



n 2 n 2 



n 



_ + - \k-- 



n 



t-i t-i 



n + n 2 ) S ( j _ 4 



4/c 2 8fc 2 \ /c 2 



n 2 n 2 n I n 



s=0 

t-i 



E 1 



s=0 



+ ^ 



- + - 

n n z 



(5.9) 
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The first contribution can be estimated as follows 



^ + l + (k/n 2 



The second is equal to 



k 2 \ e 2 ~< 



n I n 



1 + 



logn\\ yfne 7 — 1 + 0(n 1 / 2 logn) 



n 



1 + 0(0 



Ak 2 



2k 2 
n 3 /2 



k 



k- 



— +0(l/n) - 1 + - 



n 



2 7 



=7 _ 



2k 2 



e 7 fc 



fc- — ) e 27 + ^= + 0(k 2 /n). (5.10) 
n / Vn 



n 



n 



™ p 2-y2_ , nf 1.2 



+ 0{k 2 /n). 



Summing everything up gives the expected result. 



(5.11) 
□ 



Now we use the bounds that we have on the second and first moment to bound the 
mixing time. Set t = ^(logn — 27). Let (W±, W 2 ) be a maximal coupling between £i t and 
7f, and let v t be its law (such that \\fit — 7f || = vt{W\ 7^ W 2 )). 

One has 



VtiWi - W 2 ) 2 = v t {W x / W 2 }u t ((Wi - W 2 ) 2 I Wi / W 2 ) 

^ p t {^ jl w 2 } [v t ((W! - w 2 ) \w 1 ^w 2 )] 2 = ^ {]]l ~ ]] ' 2)r 



And hence 



~ 7r|| ^ 



MWi - W 2 )]' 
**(Wi - W 2 ) 2 



1 



1 + 



Var^(VKi-W2) 
Wt(W 1 -W 2 )] 2 



It is then easy to compute the moments. The expectation is given by 

(1 + 0(1)). 



vt{W x - W 2 ) = jl t (W) - tt(W) = e^(k-—) 
For the variance, first notice that: 



Var ff (W) = *W)vti)) = k*(ri(l)) + k(k - l)it( V (l) V (2)) - ^ < 
i,3=l 



k 2 



n 



And therefore 



VarP t (Wi - W2) 2 < 2[Var At (^) + Var # (W)] < C— + ^J. 

n v n 



(5.12) 
(5.13) 

(5.14) 

(5.15) 
(5.16) 



Hence one gets 



Ia 4 * - M\ ^ 



1 + 



r" fe 4- 



r 1 



v 



,. ( L+4C V 27 + ^V 7 + (1); . 



(5.17) 

where in the last line one used k ^ ^. Using the assumption that liminf k(ri)/^/n > 0, 
one obtains that for n large enough 



\fi t -Tt\\^ (l + // max(e- 27 ,e- 7 )+o(l)) 1 



(5.18) 
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Therefore 

... M . flogn-flog(^) iieXl + C")- 1 

5.2. Upper bound on T rpj y ■ To give sin upper bound, on the mixing time, we bound, the 
following quantity 

d(t)= max M-fi y t \\>d(t). (5.20) 

We define a coupling of two replicas of the Markov chain W starting from different states 
as follows: let (W t \wj ) be the Markov chain on {l,...,/c} 2 given by the following 
transition 



P((i,j),(i±l,j)) 
P((i,i),(t,j±l)) 

P((i,i),(i±i,i±i)) 
P((i,i), (*,./)) 

P((i, »),(»,»)) 



P(M±1), if M j, 
P(t,i±l), if^i, 

P((i, t±l), (5.21) 

P(i,i) + P(i,i)-1, 

P(i,i), 



where all the other transitions have zero probability. One can check that the coefficients are 
positive and that this indeed defines a stochastic matrix. This coupling has the property 
that once 

and merge, they stay together. Moreover, before the merging, at most 
one of the two coordinates changes at each time step, and therefore the sign of wM-w® 
is constant in time (i.e. and cannot cross without merging). With our choice 

for initial condition, in the sequel it is always non-negative. 

If one denotes by P x,y the law of (W t , wj~ ) with initial condition (x,y), x ^ y and 
defines 

r := inf{t > | W^ 1} = wf 1 }, (5.22) 

then 

\\fl^-fl y t \\^P^[r>t]. (5.23) 
Set D t := W t (1) - wf 1 > 0. According to Lemma O 

E**[A] = (x-y)(l-^y. (5-24) 

A first moment analysis is enough to treat the case lim n _> 00 k(n)/y/n = I. Indeed for any 
x and y 

t 



(5.25) 



P*.w [ T > t] = P x ' y [D t > 1] < E x >v[D t ] < ZVn(l + o(l)) ^1 - ^ 

Setting ^ = |nlogn + /3n one gets 

p** [ T > tp] = le~ 2 P(l + o(l)). (5.26) 

and therefore 

T mix (e) < -nlogn- -log(e/l). (5.27) 
The rest of the section is therefore devoted to the case lin^^oo k(n)/y/n = oo. 
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Given that W^p > W^ 2 \ Dt has the following transition probabilities 

p(A+i = A - 1 1 w t {1 \w t {2) ) = P(w t {1 \ w t {1) - 1) + P(wf\ w t {2) + 1) 

_ W t (1) (n - 2fc + + (fc - ^ (2) ) 2 

— 2 5 5 

P(A+i = A + 1 I wf \ W t (2) ) = P(W t (1) , W t (1) + 1) + P(wf \ W t (2) - 1) 

, (fc - wf } ) 2 + W t (2) (n - 2A; + iy, (2) ) 



(5.28) 



2- 



n 2 

One can check that the evolution of D is not Markovian (it depends on the values of 

(2) 

and W t and not only on A)- This makes the analysis of r difficult. We now sketch the 

method we use to tackle this problem: 

• First, we use a first moment method to show that after a time to := § logn, with 

probability close to one 

• Then, we do a sequence of stochastic comparisons to state that starting from 
^to — ^to ^ k/ y/n, t is stochastically dominated by (to plus) the hitting time 
of zero for a simple symmetric random walk on Z with jump rate k 2 /re 2 . 

• Finally, we use a reflection argument to show that the typical time for hitting zero 
when starting from k/y/n for such a walk is of order n. Altogether this gives that 
t is smaller than to + Kn with probability close to one if K is sufficiently large. 

The idea of combining a first moment method and diffusion in order to evaluate mixing 
times has already been used to compute the mixing time of the mean field Ising model 
in [9], the method was then refined in [6j. The interest of the method here lies in the 
particular manner the coupling has to be constructed. 

Set t a = j logn + an. One uses Markov property to get the following bound on r. 

P x ' y (t > t a+1 ) s$ B x ' y Ip^M? ( T > n ) 

<j pz.yrwW _ wj 2) < Ml + max P x '^'\t > n] 

ta ta 1 x' > y',x>-y> ^ M 

^ Y_u toj + max p. ,y t T>n ] (5.29) 

M x' > y',x'-y' s: M 



We apply it for M = J? . As we have 

v n 



E**[w£> - < 2) ] = (x - y) ( 1 - - ) < ir 4 ", (5.30) 

V re / \/n 



this gives 



d(t„+i) < e" a (l + o(\)) + max P x [r > re], (5.31) 



x' > y',x'-y' ^ kE a 



\7n 

and all that remains to do is estimating the second term. The next step is to show that 

P x '' y ' [t > re] < Q x '- y ' [t' > n] (5.32) 

where r' is the first hitting time of zero for a nearest neighbor symmetric random walk on 
Z with "jump rate" n/k 2 , starting from x' — y' (law Q x ~ y ). The result is rather intuitive, 
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as D t = wjp — has a drift towards zero and the probability of jumping is bounded 
from below by k 2 /n 2 . However, stochastic comparisons have to be made with some care 
in order to prove the result rigorously. We construct a coupling explicitly. 

We define Jj the sequence of moving time for (W^, W^) i.e. J = and for i ^ 

J i+1 := inf{i ^ Ji | (W t {1 \ w/ 2) ) + (WW,WV>)}. (5.33) 

We remark that {wf\ wf?)i> = (wf } , w/ 2) ) i ^ o is itself a Markov chain (something 
similar to the skeleton of a continuous time/discrete space Markov chain), and that condi- 
tionally to {Wj 1 ^, Wj 2 ^)i ^ o, (Ji+i — Ji)i > 0; h is a sequence of geometric variables of mean 

(2-P(W$;\ W$>)-P(WV> Mf))' 1 if £ Wf, and of mean (1 - P(W^ , W^))' 1 
if the processes have merged. 

Let (Ui)i ^ o an( i > o be two independent sequences of i.i.d. random variables. One 
constructs the process {W^\ w\ 2 ^)t > o starting from (x,y) deterministically from the 
sequences (C/j)j ^ o and (Z7^)j > o as follows (for the sake of simplicity, we do not give 
details of how the construction is done after the merging as we do not use it): 

• First, one constructs recursively {W^\ W^)i ^ o, (^o^'^o^) = { x iV) and 

(wf' + l.wf) ifUl +1 €[0,a(W^M 2) )] 
nr u Mr u M ._i (W^\WP-1) if^ +1 G(a(iy/ 1) ,^ (2) ),6(^ (1) ,^ (2) ))] 

( (iy«,wf + i) if^ +1 e( c (^« wf),i] 

where a(i,j), b(i,j), c(i,j) are chosen such that the chain has the right transition 
probabilities. It is important to notice that b(i,j) ^1/2 for all 

• Then, given (W- 1 \W- 2 ^)i ^ o we construct the sequence of moving times: Jo = 
and 

J l+1 -J t = m if U i+1 e [l-(l-<K^ ( ^ (2) )) m -\l-(l-^ (5-35) 

where q(i,j) := (2 — P(i,i) — P(j,j)) ^ k 2 /n 2 is the inverse of the mean jumping 
time from 

Using the same variables {JJi)i ^ o an d > o, one constructs a simple random walk on 
Z. 

• First, one defines Xi as 

i 

Xi = {x-y) + Y^ V- < 1/2} - !{^>i/2} • (5-36) 

3=1 

• Then, one defines Hi by Hq = and 

H i+1 - fTi = m if C/ i+ i € [l - (1 - fc 2 /n 2 , ) ro_1 , 1 - (1 - fc 2 /n 2 ) m ) . (5.37) 

• Finally, one sets 

X t = X i iSt€[H i ,H i+1 ). (5.38) 
From this construction one has that Hi ^ Ji and X t ^ - for all i. Therefore 
t' := inf{t \X t = 0} = H inl{i | x. =0} ^ J. nf{ . lW (i )=w (2 )} = t. (5.39) 
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By construction (Xt)t^o is a random walk with transition probability p(x,x ± 1) = 
k 2 /(2n 2 ) and p(x,x) = 1 — k 2 /n 2 , and therefore we proved (I5.32p . We now finish the 
proof of the main theorem. From (|5.32|) and (|5.31|) 

d(t a+ i) ^ e' a (l + o(l)) + [t' > n], (5.40) 

where Q m is the law of (Xt)t > o starting from m. Then from Proposition 15.31 

g'"*7!r1[ T / >n] <e _a (l + o(l)) (5.41) 

and therefore 

T mix (e) < -logn + nlog(e/2) + o(n). (5.42) 

Using the same technique one can get that there exists a function c(e) that goes to infinity 
when e goes to such that 

T mix (l - e) < - log n - c{e)n + o(n). (5.43) 

(this, together with the lower-bound part shows that the window of size 0(n) is an optimal 
result ([TIP]! ). 

5.3. Diffusion bounds. One is left with proving the approximation we used for the law 
of t' . Let n be a fixed integer. Let {Xt)t > o be a nearest-neighbor random walk on Z with 
transitions x ± 1) = q(n)/2, p(x, x) = l — q{n). We start our random walk from as(n) 
where s(n) and q{n) satisfy: 

s(n) 2 = q{n)n. (5.44) 

We denote Q x the probability associated to this random walk starting from x and Q = Q°. 
We want to estimate t', the hitting time of zero for this random walk. 

Proposition 5.3. One has for any given positive a and (3 

lim Q as ^ W > p n ] = -i= I e-^ds^^. (5.45) 

In fact the proof can almost be reduced to proving the following lemma. 
Lemma 5.4. For any positive integer m and n, 

Q m [ T ' > n ] = Q [X n G [-m + 1, m]] (5.46) 

Proof. We have 

QT \T>n\ = Y J Q m (V« G [l,n- 1] Xt >0,X n = j) . (5.47) 
i>o 

Moreover, 

Q m (Vt G [l,n-l] X, > 0,X n =i) 

= Q m (X n = j) - Q m (3t G [l,n- = 0, X n = j) 
= Q m (X n = j) - Q m [X n = -j] = Q[X n = j-m}-Q [Xn =j + m}. (5.48) 
The second inequality just comes from the application of the reflection principle. Therefore 

J2 Q m (Vi G [1, n — l]Xi >0,X n = j) = Q (X n G [-m + 1, m}) . (5.49) 

i>o 

□ 
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With the previous lemma, all one has to do is prove the convergence of X^ n /y/?ng(n) 
to a Gaussian variable. We do it by computing the Fourrier transform. For any fixed K 
we have that uniformly for all \t\ ^ K 



Q 









g V Pnq(n) 




1 - q(n) ^ 



fin 



/3n 



Therefore 

lim Q as ^ W > [3n] = lim Q 



2f3nq{n) 

-as(n) + 1, as(n)] 



= e~T(l + (l)). (5.50) 



y/nqn ^/3nq(n) 



= P 



a a 



. (5.51) 



where N (with law denoted by P) is a standard normal variable. 
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